Max-FTP: Mining Maximal Fault-Tolerant Frequent Patterns from Databases
نویسندگان
چکیده
Mining Fault-Tolerant (FT) Frequent Patterns in real world (dirty) databases is considered to be a fruitful direction for future data mining research. In last couple of years a number of different algorithms have been proposed on the basis of Apriori-FT frequent pattern mining concept. The main limitation of these existing FT frequent pattern mining algorithms is that, they try to find all FT frequent patterns without considering only useful long (maximal) patterns. This not only increases the processing time of mining process but also generates too many redundant short FT frequent patterns that are un-useful. In this paper we present a novel concept of mining only maximal (long) useful FT frequent patterns. For mining such patterns algorithm we introduce a novel depth first search algorithm Max-FTP (Maximal Fault-Tolerant Frequent Pattern Mining), with its various search space pruning and fast frequency counting techniques. Our different extensive experimental result on benchmark datasets show that Max-FTP is very efficient in filtering un-interesting FT patterns and execution as compared to Apriori-FT.
منابع مشابه
A Depth-First Search Approach for Mining Proportional Fault-Tolerant Frequent Patterns Efficiently in Large Database
Mining of frequent patterns in databases has been studied for several years. However, real-world databases contain noise and frequent pattern mining which extracts patterns that are absolutely matched is not enough. Therefore, a research field called fault-tolerant frequent pattern (FT-pattern) mining is proposed to deal with this problem. In this paper, we consider the problem of mining propor...
متن کاملAn Efficient Approach for Mining Fault-Tolerant Frequent Patterns Based on Bit Vector Representations
In this paper, an algorithm, called VB-FT-Mine (Vectors-Based Fault–Tolerant frequent patterns Mining), is proposed for mining fault-tolerant frequent patterns efficiently. In this approach, fault–tolerant appearing vectors are designed to represent the distribution that the candidate patterns contained in data sets with fault-tolerance. VB-FT-Mine algorithm applies depth-first pattern growing ...
متن کاملAn Efficient Algorithm for Proportionally Fault-Tolerant Data Mining
The mining of frequent patterns in databases has been studied for several years, but few reports have discussed fault-tolerant (FT) pattern mining. FT data mining is more suitable for extracting interesting information from real-world data that may be polluted by noise. This paper considers proportional FT mining of frequent patterns. The number of tolerable faults in a proportional FT pattern ...
متن کاملMFTPM: Maximum Frequent Traversal Pattern Mining with Bidirectional Constraints
An important application of sequential mining technique is maximal frequent traversal pattern mining, since users’ traversal pattern and motivation are latent in session sequence at some time segment. In this paper, a Frequent Traversal Pattern Tree structure with dwell time (FTP-Tree) is designed to store, compress the session database, and simplify the configuration of dwell time thresholds d...
متن کاملData sanitization in association rule mining based on impact factor
Data sanitization is a process that is used to promote the sharing of transactional databases among organizations and businesses, it alleviates concerns for individuals and organizations regarding the disclosure of sensitive patterns. It transforms the source database into a released database so that counterparts cannot discover the sensitive patterns and so data confidentiality is preserved ag...
متن کامل